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2 ^ 

K : *IPC*« : — 

^ tit a to : 

(tit) t 

= (tit) 

(£it) LEE LIN-SHAN 
>&^ifc*t: (tit) £*ifr-ir*B;fc*S7#&tf4*58;$.7ft3* 

B& : (tit)t*RH ($t)R.O.C. 

£ > (*_I_A) 

(t*.) 

(**.) LEE LIN-SHAN 

teJ£J0f*.#*L/tfifc*fc : (tit) £Jb^*a**JL7#art#58#.7*;3£ 

S # : (tit)t*RH (&*)R.O.C. 
(tit) (**) 



#b/3A 2 
■■ (tx) 

@#: (tit)t#&@ (^^)R.O.C. 
^ : (tit) flbfc* 

teJ£0f*fe*h ■ (tt) ^ Jb»#jMf4i.*.£ 3 Hfp*.iS5S- 1 «. 80 ft 2 ^ 

aH : (tit)t#R.H (&*.)R.O.C. 

• (tx) i*fR. 

@# : (tit)t¥R.H (^x)RO.C. 



V 



JL * Jt & ft. ffl ' # JR. « * * -fih ^ * # ^ ^ 4* ( text 
or speech queries) ^^^X^i&tt^^^^^iJl (text or speech 
information) tfti#tii^ft - & * 

; ^ *l $t *• (speech-based information retrieval)^ <tt ft ffl # 6^ * 

# ^ # »B t '^Itt^^i^P^i (monosyllabic structure) 

# 44 ' # A A * - * #1 « "I" *P (syllable)^ Jfc aft * SI # «fc 
(indexing terms) • a^Tfifl^t (overlapping syllable 
segments)^ »T W fift 3r -f" * ?P £ "t" *P (syllable pairs separated by 
a few syllables) > |S| B# %c « T i£ - & « -g- tp J* * * #1 £ 
51 # «fc m *fc & ^ & 5& ft «t J5»J jfe ^7 » it ^ » £ # BJ3 «L # 

' VX JSl £ -f # M #J A a >F • if ?$ -L i* it * * *P * 51 

# *t ¥j #r £ m. M it ' 



With the rapidly growing use of the text, audio and multi-media information over the 
Internet, the technology for retrieving text or speech information using text or speech queries 
is becoming more and more important. By speech-based information retrieval, we mean the 
user query and/or the information to be retrieved is in the form of speech. In this invention, 
considering the monosyllabic structure of the Chinese language, a whole class of 
syllable-based indexing terms, including overlapping segments of syllables and syllable pairs 
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separated by a few syllables, was developed. The strong discriminating capabilities of such 
syllable-based indexing terms have been verified. Special approaches for better utilizing such 
capabilities, including fusion with the word- and character-level information and improved 
approaches to obtain better syllable-based features and query expressions and so on were 
developed too. 
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3. . 



1. 

2. 

3. — 



0) 

& #f .J* : 

*. # B ,fl & - « it *l *fr * ^ ' * ^ - * « * * * ^ 
& mi" #f : 

& ft IH Iff- «S & ft # • * tJL iSt it I # * * 

-ft. ffl • S jft. ' at S M s& it it * # *. A 41 # #■ * ^ J ^ *l W 

A3 Iff *W aft- t # & *B * #J ; # 1H, • & ifL # * -fe #r (information 

retrieval technologies) SI ^ H. ^ tit ft & ft it «J ^ ^ * # 

Jfc * -fit ffl & # 6<j jfr iH. ' m itb £ it ft #- *b *L -ft I'J * * ° 

4 I'J SL £ A oh - *L *P ^ *L %t £ #1 m ft « X ^ ^ ^ * i^J 

m 4- (text queries)^- - X 

it ^ ft # * # A * ° it # It ® A tf -fr # * a * # it 

ilfr'ifa^Tit^^^ill^' # *P « -fih ai A 
& i iMl 4- (speech queries) -k^kik^^^^^^i^ (text 
information) - « i ^ ^ ^ W I; ^ ^ ^ (text queries)-^- #r £ t£ 
3! iMj IK. (speech information)^ ttiliSAWittft^ 
(speech queries)-^- iklki&^-Q^&J'g-ifl (speech information) » _L 
it it ^ * & ffl *H A n *fe # ^> « t# -g- A 4- * "^ ^ 1R, 
£ (speech-based information retrieval) • tfL # vi #J » # ^1 
4? S ^ « ^ ^ -I- ^ ^ ft if 1R. -far * « - t * bp a ^ *t ^sl # 
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it 




' i& *r ^ * a & m f& m & _l - ffi m e, a a 




# 


"fR* 




*b # * * * tfj j # tSL 4L j$ - ^E. Ig ^ 6<J •!# : t JL T • 
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* £^ftirm^i^-fft&L;k&^° 3 - 7 S - * 






& 




* 6<J ^ 4A 3, iSt <f (hand-held devices)^ A ^ ^ > PDA 








%&g.#-&m^-$ffixifrtitm ^ t a 1 1 i B t a * 




& 








it 




ft «. «. /fl • ik # t# -fr * ^ 6<J ^ life St # jt ^ * 








• it * *P & A 4+ * « IS- * ^ 46- * & if *l #r t S£ # * 


*. 








6$ # 0 •^*K&#.£;*.#-st*4t*fc^T ' A « T 






-f- 




^ «t m « t# -i- * ^ ^§ 4- * # *■ £ it f- ik. ( m m 






it 














m 7 0 -t & ' # a# «t -ft ffl # W £ ^ ^ 4- * * * 




















* # m a &j m -8- n -t - t m w ' t x. * # t m f • 




# 


t 


X. 






JL 






* ^ ^ * h % o a ^ , ^ t ^ m - # 


ft 










JL 


i 












«■ 


# 








IK* 




^Tii^^i:-^i4#^r^^^^^4- (queries)^ * 








* 


^g. ^ ^ Ul is It (information records) 4£ rt; # 


o 













li^'iS^A^^A^i^^f (queries)^ # - ^ J # 1H. i£, ^ 
(information record°s)-fe jfcttffl^.ifc-fllTJI&^ISI ' * ^ 



5f J* (acoustic conditions) - %% % (speakers) - ^ ^ W I >t (speaking 
modes) ft #. 1R, (background noises)^ #j ^ l»J » t # f S i. 

f § Jl ^ ^ s i • ® iib *f ^ 4 -ft t% 4> & ft- is. ie. * ft "s > * 

+ 

& A & * ft *L ft >® #1 # 51 # «t (indexing terms) « ffl #'J 

it 4 #1 48 4- * ; £ in, *s & ^ W M S A • B ib • *» ^ 4 *l 

* t * t# -g- ^ ft #t * t# -g- *i #r * ' «, * * * w * ^ m 

iLft^^^fc&JL^ — ^^A.tt#3t#r"*ik*#*. 0 ft-& T A 

JUL ii * # * #j # it # * #r i4 A W ¥ • # & ♦ ft # *■ * a -fl 
m * i±s a tf -g- Jb £ j& 1if «l * & #r * # ift W * S 
^ A # in **■ * ( m % * A sl 3$ & ) & ^ W ' ft * * * 
*t 7 # 31* (robustness)^" T « ° 

t x m vx n -g" * ; # i& *fc £ % — « -±- * * ' «. £ 

* it # it # W * 51 # «t (indexing terms)^ |5]B#4&i&ftfl3#4 

3g # a w * 4 #j w # - * * ta. fc m ' «. # •fe n & a w # 

*«^* : — **tf«M*t"W (Keyword)-^ A 

* SI * (keyword-based approach) » j5 - t f J ^ « ^ t ^ ^ 
^ -4 * 51 & J* (word-based approach) - *r # # £ « M in 
Jk & 51 J- * #J >f . .jZ? ^ * & A # * & £ # - * IT *l 
^^.A#-ftLl«!M ^ (keywords) • # ^ ft Jfl ^ t A 6<i ^ ^ ^ 

' ii ^- - *■ ' 
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m & m & u ^ n ^ ^ i& iz & tt ^ v'x & t & & ° *i 7T & 
# # ffi a ' z$r&n&tkikten&M (static^ f num. - 

ift e, m. & it t # «r * <ft t- m le, & #j ft g. • ^ ^ *& *i £ 

£ 43 fig- W 5S- #J 3* ifc T ' it 1H. fc ^ # # <* * it I # St 

St ii # & # £ ft • ffi & M A ■» ft m *t n to. ft * 
& ♦ m H #r *T *q * * ^- * 51 ft # * • * #. ffl * * ^ « 

4* * #r t ft u it m *p * * ft ft « a. * *. * & ( -*r & « t 

x. * A n 6<j 3- ^ M. $L ' 4 # 4- * *l t£ & # *p »T « 

3t«*te-th*Hft&*f££.tt)' ^ £ Ci # * m # ^ x. * Q 

^ f- in *fr *• a #f * T « ± *fc ffl • f& *h ' *r «. a m. m 

it tk m % ft n £ t ft Ik 51 # flfc # >*• ' *3 * *h *3 * 

(Out-of-vocabulary - ^ Pp m 7 ^ ^ IS- -g" If 1ft H- #j fs] * + #r & 

^ ft n • *& + itift*-^*Mft^**.)6<i#^.^'#-A-'ia 

fl«-HJk:fc^**f*#1ft&ti&#ft-£-'<B1*£,&&*J- 

^ i*i * ' * # *f ^ t- *i # £ ^ -r & # * # m & . 

^JIfeH*>4**&^^iif^l8*3**tiftaL^r^**#tft 

^ ii ^@ m • ii -fs m m. m ^ ?! a 4 ^ -s. " tsj " ^. .j> t^j >t 
^ _l it n ± an & 4- % m tt m & & m *± *t & • a * • 
ii * -ft k r ' ii * - ft * * " * 5 i " it - >t ^ • n + ; ^ *i 
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# m ft & : 

^ 4l f- t ' # I t ^ ^ f ?P ^ t (monosyllabic structure) 

# .fi ■ # th - & H « -fr bp (syllable) tfj ft If # 4± * * * & 
Ik 51 # «fc (indexing terms) #1 ^ + #} « If -fr J- ^ 6$ % U 

4LSLJi4<rJ****3lW**J*&^ ° 15] B# ' ^> it - # afe ^ T 
« t x ^ I ^ iU i 4 ^ t ?! # t i f ^ i! ^ f # *J ^ 

It 2& v5~ 5^ : 

I . ft.ffl-8-«P^^«fe"*t # **■ 3£ & 

m z£ & & — ^^--g-pp (monosyllable^ -fc- • + 3- ^ — # 
& ' Pp A * #P £ & * *f I s ! (new words)A ± • *f *l it <f 

" "it #i ^ ^ ^ a t - is *f ^ " € m " ' & 

jft " * " ifr " > " -ft. " #r " it raii^l^^^AT- 

^^Sfe&^-htt^f"**) " Jft ^ * " ° £ £~ # ft T • 

it # *f n ft n * £ •y ^ » a *& « ^ * # im • ^ *b * £ 4 
% % n & ^ & * *& * £ *&&2Lft$L%i}£.ti ( ]ffin%<!L%p%. 

ttBfli^^f^l » «t -ib I§1 -h © #f *. ft #J "f- - • SI A it 
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ft f # * * * >tt *3 * #P t t % ± & % & & & » * # * * " 
6<j 1§] * * • H] otb £ #L ^ « tf -g" ^ & #J t X. ; # *l £ ' 
i§] jfc ^ t§] ^ (out-of-vocabulary)#- jt #j ft i& # j5"J W H t ' & 
*t*A'H-*^#^« + iP^<fcW <fc *t # (syllable-level 
statistical characteristics) (ft £ ?] # «t # * * £ * 

ffi^r-ll^tiis^) ° & « ' <£. t * * it # 6<j 

I- If ii ^ ^ « ^ * f * I- 1 1 i ^ ii ^ w t * • ^ ^ i b 

-0- IE * # # * 51 # «t ' «t T ^ & if *l *■ # * « # 
J# -f" f p >f 1ft. (syllable-level information)^ #1 ^ « !§■ A & 

*& t x. J # m * ' w ^ * * * * * * w * A ° * « t *■ 

fp & * w * t *j- *i - 

-g- ' ft. # t 3c ^ -8- fp it a -fit # 1,345 « ■ & ^ # ^in n 

& - £'] & 4B ^ (A # Sp )#f &L & ft A ' ^ it 1.345 fli -S- ?P 

li^^-fe^pfliif-^lisp^ (polysyllabic 
words) > A 4» ««i * ^ R -Sh £ + "W (*» ^ ^ ' * ^ ) 0 
H jfc ' *r £ #L * ; # «l tit * Nf « & -fill -I- pp flL A #J ft ®- A # 
&&tt&#^tt£i%ft4»&fctitikttiti&ti&ifa' !i ru&. 
# t tic Ik if Ifc ° 

% — if & ' # fll -f- f p ># /£. i£ "HI (syllable-level information)^. 
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$k ^ i® ^ 1$ $l ~ i® & % $r t i% & tti n (morpheme) - 

i&-gjL*ryZ%te-£m3-&]M & ° fft M ' * *t ffi 4= A ^ 
B# ■ifc-W^&^'frddtfS* 44 6<j '|# i£ T - 

X >fb "A ft xfc *9 I 5 ! #J If * • 12- A t n W 3? — #1 * * * I s ) 
#j • 3 - #1 »T »i* Ife # i'J #J 33L $. 4: -4. t ^ * ' — * « 
Pit * l» X A *£ *| , £ ^ fit " # * ±__" & 

% - m * $=.<tt»a#.#.£t — 4B4 E *t'3"«tfr*A " B # 

-fr " o ^ ^ , af Hf- - in & *h B s" 51 A. 6$ t§] (exotic word)^g. 
& *& #J # -fr T « U # A ^ |S| #J *) ' #'J " Kosovo ""sp « 4te 

# A " # * /kel-suo3-wo4/" - " fa t & / ke 1 -suo3-fo2/ " - 
" £ £ A /kel-suo3-ful/ M # * /kel-suo3-fu2/ " > " *t * 

# /kel-suo3-fo2/" ^ ^•4sii*M^#i£#jt§]i£'t3p^#- 

# -8- *p A # 4r «P -I- hp 4rp A ;fe m ft ° A jfc - — <te .ft S 
6<j & * $ & * IU i ^ ^ S t 3: i? -I± 

& 4s * 4* ^ # #j J # in. ie, & *r * 1 5 ] *I ^ * si ^ ^ * 

# - fa m & it n te <ft * * t « *t a ^ * & * a ° ± & 

-Sf- If * ;Jt tb *f-*&4hJt*&#4***t-|-*-*t. <fc <tt £1 M 44 

^t^ifit t " *5 "it ^ - £ ft # *t # *fc A *■ ' ifii JL I s ] 
fl> A is] ^ ^ i£ ^ m A 4b MM ^ » *" *P ^ * - * #1 is] 
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II. & <C & tt 

A . -g- bp J» /i: £ 51 # & (Syllable-level Indexing Terms) 

*■ # B fl H T - $ ?'J « -8- bp (syllable) J% £ aft W # 51 # 
#1 ' & ^ T « ^ I s ! * A #1 t & "I- pp # ft (overlapping syllable 
segments with length N, S(N), N=l,2, 3,4,5, etc.)A W R* # -f -fr *P 
it "a" *P (syllable pairs separated by a few syllables , PS(n), n=l,2,3,4, 
etc.)^ £ 51 # flfc a % • « - im -k Jt Jh 10 ft -g- fp # ?«J (a 
syllable sequence of 10 syllables S, S 2 S 3 .... Sio)-4 i^'J ' # ^ ( * I s ! -ft. 
^ f 4 i fp )i t )?'J i B - ^ -f # ' t t ( ^ f t 

«p ^ « * ^p ) iw n & n - t 4 *p ° *j *» *. a a 3 # * # 

-g- gp # ft (S(N), N=3)& ^ T -Sh «P ^ ft (S, S 2 S 3 )^ (S 2 S 3 S 4 )- (S 3 S 4 
S,)^> ' MK-'IB-g-^4i1t-g-Sp (Ps(n), n=l)# (S x S 3 ), (S 2 S 4 ), 
(S 3 S 5 )^ * • ^ I- t ^ i t W ^ Uli # f ' -L & it g -fr hp 

« #r 4t ^ #J ■ # - 4® -I- 0p ^ ft ^ ( A ) ^ £ * ra « * w 

A + tp t # - * * #a 1 5 ] # > gp & # t * #j is] ^ >$ & jfe *h 

* (out-of-vocabulary) • tf--&-#r-1ft&4fe&-tt1ftibA. 8 0 Jb « 
* A -4 1 #J -I" pp # ft (S(N), N=l)#. # * * 51 * ' & tk £ 

^4t^|5|<f*6<llSI-|-^»*»*'lfffl*A* 1 6$ * fl # ft 

(S(N), N=l)#. ^ f 51 ' £;ifr*a#<fc£-fr#£JL*6<iS&* M 
« • El jfc <fc M # # i* ^ * «. ft £ 51 # #1 * H ° * * -k ' 4fe 
4* X 5,000 -fa ^ J?] 6<j £ -g- fp.lsj 4£ (polysyllabic words). ^ 
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z_ Jl + sx _L ¥j n A 5C * ?M S T • *t * « *& 41 & # * *B * ^ 

#J + • W « ' « * A A 2 #j -I- gp # A (S(N), N=2)^l # A * 

X & -k* -k J$L A 3 (S(N), N=3)£ * * * * 

* 51 £ i *J # 4- # * * #J ; # *L <ft t B# • * it $3 # ^ * 

i w f # f m t ^ £ t ^ ^ t ^ ^ * ° # - & * ' * -t- 

ijt > " m_% iL^ -£ * i: 4a ^ t^^^^^^^, " a 

# -jf&te7J$#.«j#— - % m u %~ %i & - 4® 

«JS-Hjfc*.#^WItA6(lttMK*# J f--S-ff-*.1t + *P (syllable 
pairs separated by n syllables) t 3\ ft 7T & *fc W « it T « # * 
it til W « • # * ' * * * t *S * *t life * t 't * -I" ^ 
ft (substitution » # F P - fa -f- ft $f flfc A £ - #1 -fr gp ) - & 
^ (insertion • j^Pp^t^-fB+SitW-frff + W ' * 6-j M it 
f ifc — -®^#^t^-|-|p)^JL#Jl^ (deletion • # J?p — #J afl afl 
#&tt*»p"£**1ftl#**#)3F-«-**(l#£. ' *• # ^ #T * 
*6<I«Wl»3& J f- + ij!4Llt-8-ff-ft*fl (syllable pairs separated 
by n syllables) ¥j & ft fa & 

& *8r * _L 6<j & $ • J& flj "T ^. ' * -Sh I P (monosyllables)/^ ^ A 

•y & m ^ ^ i% & *v & ft m. ° ft * m n & & ft & ^ 

M M M # l*l -Sh gp #r J. £ * ffl M. ' ^^yxti-k/t^rt- i 
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6-J 4 # 4 ?p # -S. (overlapping syllable segments with length N, N>1) 
ttXWH^fftpiftl-Jp (syllable pairs separated by n 

syllables)/*/? W> A #J £ 51 # ft IS * * ^ #j *& * * *t ° * 
# 4 bp # ^ £ ?! # ft ^ ft ^ £ 4 ep i£ # t£ 

(polysyllabic words or phrases) #j iR. » *f^*fr&$-'tfc^L#f'4 

i#* + it*#.#^i , -l***fr£^'«*i*tei&4*r-iftJ. 

it tfl -ft (substitution) - # (insertion) ^ 2^ M>\ ffc (deletion) # 

# A # -t- ^ - * *I « 4 *P (syllable)^ £ j« ft * 5 1 # ft 
& > «f ft # - J if « * 4 48 4- * # - * * 4 * *P M tf 4 
#r-U&££.*ri&tt4fp#-#tfiL (syllable-lattice) • .& ii 4 # 

&t'#«-«-S-ff«i»fft3l'*##**rlWft4* 
gp (syllable candidates) > ii £. J% T&JK.******^**^**- ' 

«fc ^ it ' ft n ft -t i*. «f # - 4 fe. ^ #f ^ A # * 51 # ft • 
*SI#«tta-it*t** , 64r1 #J W&AfJp«i**it« 
it Mi ft ft ° & 4 ^ « 4- il IT «L te « t 4* - * x ^ 
^^'U^t?!#ft^^it^.**^^^^^.^4^^4- 

# T *■ # a /J #J - & ?'J ^ 4 fp (syllable)^ * * W * 51 # ft 
ffl*-«**44#i*l4-**-*it4fc*'Jl , J B tH" * £ # 

-iX^^^^fiH.^'t (text-based information retrieval) & iS& 4& 
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MiftitilLfcik%L*2. (information retrieval models)-^ 3p |S] it i& »T 

^X^5!^Wft^t^(5]f§ra^^ (vector space model, 
it A fit t tiSi X * 51 ^ # jjf *l ^ #J A 3|S & #J & m ) 

' £ it ft ^ T ' * *fr if *l te * * 4 *S 4- ^ x * SI ^ 

# * it -I- ^ ^ • *p »s* « it - & # «. i#j i & it •& n • * 

+ — (component)^ & £. — & « 4 bp A & ^ 

* 5i # at & a* t b# ft ^ ^ m • m n n > « *■ # 

(S(N), N=l~5, and Ps(n), n=l~4) » *t — "ST « ffi 9 |#^itf 

-ft * # - * *L * & * - J M 4 *b « 4- • flj ^ *i to it * 

4 # 4- W * nfl .|4 «t « 4 *b & 4- & # - * *L is «fc «i 

it 9 IS # ft * (ft -f® #J Jfc «f & 481 # 

B . 4 lp - z£ & n ^. ffi % ft ^ M t><] %k & (Fusion of Syllable-, 
Character- And Word-Level Information) 

?# 4 & ^ #J ^X-'j^rlR.^^: (speech-based information retrieval 
for Mandarin Chinese) t&.&4tft&#J*&Mffoj] ' * * *l ># <fc 

_l #j *i a *r ^ * * •y 4 fp #r & # # m *h m - 

*»'F34 i £*|-A*!>tefcl4*p #r m ± ft » M *E T * 3= ># <fc 

_L «j iff" *L /If * • IS] I'J * # « 4 Ip Jt A * «■ f# * ; # *l • 
-fa ^ - 7T A • « * A n ft. ^ ^ * 51 # «t ^ ^ tt 4 -4 * 
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n t. si & m M a i% ' m & it t *t ** ^ -s- bp - ^ i*i ii j= 

* * IS] /# ik. ft ^ iH . If) ,^ t t f ?f f f i 4 ^) t X f 

in. 4fr * -fr * #f j& ° ft i§i # it -I- lp ># * ft * ?| # «fc • 

7 ^] -fc ^ ^ f # ^ ^ li ic f # li (C(N), N=l,2,3,4,5, etc., 

and W(N), N=l,2,3,4,5, etc.)** ftffi& s F*&is\*-%L*&1f&l 
(P C (N), N=l,2,3,4, etc., and P W (N), N=l,2,3,4, etc.) • — & 

* f- *t tz m m w*M«LA«t-sr«ffljiit+«f!-** 

*|it=.#>t^:W* ?! # ft -fa J5»J W # f r^j f 44 tb # & 

^ ^ i ^ i if # • 

C . & ^ fl- & 4£ ? I #j % f I # $t (Data-Driven Indexing Terms) 

X3&yJ>^F\-kJjt¥if[&'1t1$MifSL (overlapping syllable 
segments with length N, S(N), N=l,2,3,4,5, etc.) > ^ >f «. ^ iq # St 

A *. - «■ it # * ^ te -ft tt & * # # £ * • # R£ H ^ Bf ^ 
4fc * ft <ft flf 4& fij ° zfc it 6<r 3r * • «r « ift - # *J ffl & if ft 
>*■ ' m € ^ a A U ft *fc #L if # # ( #'J *» 0f # ** «r * 6<J J # in. 

ie. m m % a % & ) t # a ^ *t & jl t# * & & # -i- ip 

*^*4tW*l?^A(iil*>tA ' # «. ) ± ^ #J fi£ - n -to 
&$P$&& t ^ft & " A S /jian3-pu3-zhai4/ *' (S(N)j& C(N), 

" j|L it /jian3-pu3/ " " it & /pu3-zhai4/ "(S(N) ^ C(N), 
N=2)^ m ir m if * ^ * 4t • i) .ft i4 #: #J Kfe- - it « ^ jjf # 
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5| (data-driven) fa ^Tft**ll!*6(l*5l#«l ' * 4£ ^ « - 

t % J*. 4* ^ • 4ft & *a I s ! & ffl * * »p " * ^ ts J — ^ * ^ * 

?] # «t 0 n # ft * W ' " *■ * *k " *■ "I*- 7 ^ * 

^ " & & " — « I?] w m A W *§■ * & «■ #j * ^ # ft ' A m # 

6$ * 5 1 # «fc • -fa " m & * " *t " Ttf **■ " * « 

1§] • is it & % %% * i£ * * 4£ ' * A - #1 «. * & W £ *l 

# ft > & t t ®k & & ^ *~ ' I'J T « t ° ii * & # *r J* 
51 ft * 51 # $t (data-driven indexing terms)^t M. £. 3~ ' « -fr 

pp ,# ft £ 51 # $t ' -T #7 4r ft * A 1 ft -0- Sp # ft 
(S(N), N=1)M jfe ' *X & T Ha _L (bottom-up) ft ^" ^ • it ^ ^ 
3c ' it * # ft ft it -g- Ip # ft ' -^tt^^ftit^A 

* A ft (N=2,3 * )*r ft -fr 0p # ft ' # ^ ft & 4* * * 
*^«4t*fl-*(*»J*»W****6<i*"*' * ^ A ft * 

^ * ) t *@ it ft -S- Ip" # ft £ «■ « if # * t ft £ # ft if 

<t ' iH -to* it in Hi A. W! #j ft £. ifL .1? * (mutual information)^ %% f 

4& ^! #- *t (language model parameter)^ ;fa $L A -ft # 

ft ft "it & 44 ' A & it M fa ft ft if & <t ' # * * I s ! *■ A W * 

?| # «. ^ ^ 1^1 6<j it d 0 «t ^ 0 # i% -fH #1 it #j -f- gp # ft 

*. * ^ if *t <i ^ ^ m -ft d 0 ef - i$. *s ft m & & - 
« ^ a *f ^ ^ pp >i ft ° jfc - J. 4. # « « t is m a jsl a it 

j&mfflit&±°m&tt&fc&&ft&&£& > £ftfc^^w 
& & -li % jl * * it #j ^ >t ft * n % ft * ° 
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D. -g- fp >f #j ^ -|- t& (Syllable-level Utterance Verification) 

* -jt + *t «L (syllable-lattice) t ' - ffi -g- sp 6<3 t# -g- 

& $ 0f # # 6<j ft it + BP *t § & 1 f'J m H# • I'l f If 

Ip X -S. (overlapping syllable segments with length N, S(N), N=l,2,3,4,5, 
etc.)-& ffl Rli 3r -f- lp ^ -f- gp (syllable pairs separated by a few 
syllables , P s (n), n=l,2,3,4, etc.)#J a *t 4T M % to 

m m N * m 2 Z_ £ • 3|ttfli:t^rttt-«f^i 

^T«*l^5t*6(l48t-*^1R. ' m N -l & m 2 -l « * f| 

^ t- a t ifl: « i ii t ^ ffl ' « FM& £ ?l *t 

4II ^ & i& ;t ift I?sMJl (pre-assigned threshold) B# > £ £ #j £ f| 

# ft *t f « it *i * ■ •r«^^i^?i#tn! : ' n # - m * 

5I # ft ^ (SJ #j ft <t #j «t ° 

E. fa Ik ?I # U. #J AW fi£ (Deletion of Low Frequency Indexing 
Terms) 

"T vx til # ^l#li#^tilili % « 4& #J -fr Ip M 

B# • «. T ^ « #J fi£ • S jfc £ B £ t ' £" 51 # «t <Hj *fc it ^ 

ft ^ « I i ^ A ^ - « t ?l # t fJ ;^ ^ ft t • Ji it f 4 
-fq- bp # -S. (overlapping syllable segments with length N, (S(N), 
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N=l,2,3,4,5, etc.) A WlSb#-f--g-«p4i1£-8-§p (syllable pairs 
separated by a few syllables , P s (n), n=l,2,3,4, etc.)^ # — Ik 51 # $t 

* A -4 2 (S(N), N=2)Jb fcl ' # - « & m 

*«P&^A«i + «P^-ft(s k ,Sj)*rfssi<fcifc^^--'ttlP* 

* £ #J N] li r 0 # > ^ -sp #J fife- •£ « if it * Si ^ ° [5] & 

*fc ' # # - m ik ?! # at *r « ^ m w m & #j «t ^ • 

F. & & $f * 51 # #1 §^J #J ft£ (Deletion of Stop Terms) 

* JL ± -fr i p * 3=**ltt#5l#*t8f' *r 49 JW * 51 # 

f 4^ X # U t f I (Inverse Document Frequency, IDF > it A — M. 

tit>f£jL&&M&5\¥f$k n * (stop term list) • ii m A & * * 

m n ti & t 51 # & • *i " w " " & " ^ ^ 4h * * * 

* -f- «p *. * A 39L £ - ^ 3r 1H, fc « t ' *fc % 4r >'l * * 51 *h 
& • Ejfc#:fr#-#i-8-$p£5!##: ' *l *» * £ -0- ft % A 
(overlapping syllable segments with length N, S(N), N=1~5)>S. M ^ f 
-f- -|- §p i£ -|- Ip (syllable pairs separated by a few syllables , S(N), 

n=i~5)^ - 3p -r me: 3- - in m $ m t 51 n 4t • i >i i i t ?i 

51 #«*ife38,4Lft*#* 51 *J * t 6<j 
TIT Miif f ±t^it?l#f (*^P IDF ft *t # * * )«. 
#^t^^t#Ji^°ii^M^<i^- 5 r^^.#--i^^5I#^^ 
IS: £ • 

G. % %B ffl ig 4£ (Automatic Relevance Feedback) 
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St A H — ;Jt ^ * b# it ^ — ^ £R ^; ^ * 5'J * 




















(relevant or irrelevant information records^^T J?l ilL il 4ft 

\ X W X W V t* XXV V X XXX w X V C4- XXV 1111 V_/ X XXX H- V X W XX X V V< V/ X \-A kJ / ^ » y 1 J >^J ^ p_j 


it 








^ * ' ife — IS 4£ JH ^f" If Ift .L Jl jE M # #t 










>fsr o fi >ft ig M 5g i# * fc II — ^ * S'l #j > "^T 






ft 




* *a Jfe #j ia M ijf fe & + ^ * je #j * 51 # in 




ft 


/ '4 








t 

1 






* ft M 6<j IK. i£ ^ t ^ A St "ft 51 # «. ^ ft. ffl 


# 


w V 












J 1=1 


< 




* 


















H . 51 # $t i% £g (Term Association Matrix) 










*»*i^'IB*fl#*t , l|f 1 frP3B^thSl (co-occurring) 


ft 


m 




iR. le. ^ (information records or passages)^ » 






"T 






» 









AtH!Tft#4*««Ai40WA«|it -14. (synonymity 
association) • ^ ^ si 6<j ^ ^ • T«#e.-i&-#fc#T##jfr#lte& 

«t Bfl it ^6 I s * ' ^ Rfl it t # - « 7L ik a(m,n)^ & % & 

m m £ 51 # m. t m ifo t„|S|i^tbst4E.fti*l*-«LJte«tJK 1 «**(i 
« * *fc "if # • H Jb <fc ^ 4i ^ it i% « t 51 # «. 4l HO * * 
^ it - *J *» • ^ M it *E t * - « it ^ a ( m , n ) «ft A 1 • 
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fe ^ jSE. «. & t ' 0 oib - ^ *r # * * tfj ft & M it *t ; # Ml it 
t * - « it * a(m,n)<Mj it & 0'^T«fe^4t#*f!#«. 
t m t n #L & # 1*1 b# A £ }a |5] 6<j ; # *i to 4t jBc. «. %■ t > 

^T^A^^Ba^^ 0 ^^'^ *i «. »r « *a * & m & w 

«i*#4-t SI # ft m & m it «I4 *. ^ l « * 5! 

1T #J ffl 

if #■ Wt W 2 • H 2 # 91 - ^ & 4l a H • * t *t 

&7±&u*m&j£tt±&*ft/*/iQfi&£ l teM>& j f- 
-g- ft i * in ^ ft + ?p / * /*? w 3- * it 4f ^ * * 

# at m it f# > * ?i #i # & % m t 5i «i i*- - m ^ » 
& + ft i * / n * f I # at 3 *b Bfl aa « * A * # ^ ^ « 

« ± il f ^ * f t t t ?fe • ^ ^ 3^ t f I- #'J 

H £ j(|f Jp. ift ^ : 

m l -4 « * ft n S, S 2 S 3 s, 0 AWtij&HL-frftMr 
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l . - m t if in. ^ £ if >£ ' & ^ : 

$r >w & & #r «t 4 Ha it *t ^ Ig- -g- ic 4= & $j 48 4- ; 

J. 31 i«. * ' 

2 . t if 4 *J 16 SI * 1 t X It *l * * * >* ' * t & 

# £ -ft. & ^ — ° 

3 . -ku t i* 4 *I ia IS * 1 *S ^ t x ; # 1H, *fr £ ^ * • * t 

# £ -ft. A * 3. ° 

4 . - *fc tt if -fr 4. & * t x f- *L *8f * 3r ' eL ^ : 

a. & it #r «t it *J it *L *S- -I- 4, x ^ £ ^ h% 4- : 

J. si iff- ih. la k > 

* t t i ?l # t f- 4 - ^ ^ I ^ - f f i ° 

5 . *»t*fr4*ll£B# l ^It t • *ti 

# £ *. A ^ ^ * *s ^ jl 5. ^ a - ° 

6 . *» t tfr -#• *'J IS ffl # 1 * 3. ; # «i *8r * # ' * t iz 
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7 . 




t 


if 


4 




4a 


IS 9 


4 




t 






*8t * 






• * 


t 






& 


51 


# 






T A - 


ffl 




-f- 






^ ° 












8. 




t 


if 


4 






S * 


4 




t 


X if 




to * 






• * 


t 


it 






5] 


# 


«t 




Jb - 


w 






















9 . 




t 


it 






4a SI # 


1 


, 4 > 


5 


» 6 - 


7 


£ 8 










iR. 












» 




t it 


t 


51 # 


ft 




it 








- * 


0 




10. 




t 


it 






4a 




1 


, 4 - 


5 


> 6 - 


7 


8 










"iHi 












» 


* 


t it 


* 


51 # 


ast 




* 




Bp 


>t 


ft - 








/Mr 

BP 






ft 














#^ > 








*I 


m $3. 










ft. 


t 


it 










o 




















11. 


■kv 


t 


it 






4a S * 


1 


. 4 - 


5 


> 6 - 


7 


A 8 






t x 


; # 






fa 


* 










t it 


* 


51 # 








• it 


t 




J # iJt 




* 














































it 


it 










t # 




* PP 




^ A 






it -0- 


ft 






A 












-fa « 




4fe it 


-fr- 


gp > 












«■ 








-g- 


bp 






* 






& ; 




























%% 




* 






t & 




■w BP 




^ 






it -g- 


ft 




















Ji 


tfi it 


-fr- 


BP " 




A in 
















-1- 








* 






*a ; 




t it 




Bp s 




*, 










t 








it 


-fr- 


HP % 




* in 








it -t- 




it 




















it 


o 
























12 




t 


if 


+ 




4a SI 9 


11 ^ i: t i i 


f t 


H *8f * 7T ' 




it 




t 




# 


at 


% 


tL 








> 


JL it 








it 


% 51 


# 


«. 




ft 






-si 


m 


^ ^ it 




ft > % jfc iS] # $f 


it 




It -f- 




•Si 
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13. -k* t ft 4 *I la SI 35 1^4^ 5 > 6 - 7 & 8 J| ^ t X f in 
*fr # 3r * ' * t « «■ + A & * ^ t ; # in. £ « & ^ 
* * *$ A # ± Ha & 4- * ^ * ^ ^ in, *a 



^ jot. % -g 



14. t it 4 #'J la 9 # 13 ^1 ^ t x f i?l t t 7 
£ tfj # 4- A ^ *l ^ « * ^ * 5- * # ' 

& # ^ it « & n£5ift®i&&^ : $i&&#j 

A f" *L if t A SI 6<j * it • 

15. *> t *fr ♦ *I IS 81 * 1 * 4 > 5 » 6 > 7 & 8 *jf 

ft £ if * ' 3 &*A#-£tf&#4*A#-* 

If - a. # «. ft * • ^t#-#f^f&it 

t # - * fi # «t ^ £ *b m 4* * * *i « + 

m ^ it # # # ^ it ( # a *t -t- s 3, J. * ) *, & 

id 1 5 ^ ^ t ^ ih. ^ £ if 

^ ^ & ^ & # - f in. le, & £- # «t 6 * 

n & # j* m -a ' 

17. -kv t tfr 4 *'J II SI 9 1 * 4 ^ 5 r 6 ' 7 * 8 ^ i: 



£ it) & 



t 
# 



t x ; # *l 



-fR. ie, & 
; # UL # 

• * 

ft It 

m m 
t *. t 



-fill J#J 



171 
# 
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v J 

ft - &l £ 51 # $t ^ & * A 1 -fr *p - * * *l # «. ' 
Mtt - J'XiiiTfti-^^^ ' * #8 #P &j -g- ft - ^ £ IS] # 

& > jl « it * & * ^ -g- >p > * a n n a - it ft & 
t 4l - ft -it *t ft - ^j^^^^^^^^-t-fp * % &n 
n ^ « & ^ « ^ a *f ^ * # ft ° 

ia t t -I la I I l7^|4it*.ir*L«f*^r*»*t 

19. 4p t it -#■ *I «. SI * 17^^t^'^^^^^>***t 

it 3 - * A * -Sh pp * * * ts] -Jfe Jt. A 3 • 
2a ^ t f -I ^'J B 1 I 1 7 J f t X % Hi tk * ' * t 

**fc-tf-it'flL"*rib«T«*ii^A^ - * a *t w -g- 

<Mj *B S. ifl & * 0 

21. *li£K* l 7 ^ 3i t ^ IT tR. & £ ?r & ' * t 

tt*fcit#tt"*rJfct*"3">x#i4#j5& 1 3 — * Jl *fc * w + 

Sp ' ***H>tA4<r*'«4ft»h*lfS * * it *I it -ft 4fc * W 

22 t *fr 4 *I l£ ffl % 17 ^^txfiR^i^^ • * + 

f liii/f #lf ?Ut?l#ti^«t • & £ £ £ 

I* ^ rfj 48 #a it 4£ * -I- pp " 4= ic ^ *t A. « ^ A 3 - -ft. 

A -ft. -g- tp * 4=ifc^#A**fEJ&#r<ft£ 51 #f »f ' 

#. «■ * ft * A #j -g- sp - * * *I it «. £ 51 # «t * * * R 

6<j fifl <i • # it ft *t <i * ^ HQ <t af • «. # it p& * 
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23. *» t * 4- *1 la ® % 2 2 H 4l t x. J # *l *fe- * • % t 
*f • £ £'J & # 4£ i£ #j -0- ip " * * # A. 6<J ift If & 

24. t if 4 *J 4a IS * 1 1 *M t X f- «l *fr * 3r ' * t 4- 
4fc it -I- - ^#*5iL***li*a-*fc#«,;*--fli&*fc£ 

6<I ^JL Ht ' ^ it * « P * ^ ^ -?§J «. ♦ * #J f£ » 

25. t Ht 4 *J la IS % 12 Ittxf iR^t7* 1 *t 
i& & 3] & 4S(,iL^Jt#«.^-fll* 

2& *» t ifr 4 *J 4a IS * 25^f4itit*-*l«t*^*'*t 

£ » i*|S|6<i#5l#*fc-g - *tJt*|S|6<r'tt • 

27. t *I |a IS # b4>5^6^7|8^ttxfiH 

ft A - & A « * 51 # «t ?'J * ° 

2& *» t it -* *I 4a IS % 2 7 *Jj t X. if IK, & t 7j & • f, tL 

iiui#t ^jf t»m*€iife^* t f i # m. n 

& t 60 M & -f- m 4t * tb si 60 £ 51 # «t • 

29. #II&BJ# 1 - 4 - 5 > 6 > 7 8 ^ t it if 1ft 

*fc # 7T ' 5 & 5! 51 #«t 

M it *E I* • ■m&.&tL&g^M&.&Tl-M: ' # - *E P# 7C 

* ft £ -ft i% ffl * 51 # #t 13 & J£ £ ft 15] <tf> iT & te & 
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" » ^ 




t # #■ ft if # 4± • 
3Q *»t*4-*JI£a* 2 9 ^ 4i t ^ ^ ifL ik t ^ Or ' £- t 

itTt^^r^,^^ o # i ^ n 4l # ^ it -a • 

31. *> t *fr * *'J & ffl # 30 ^^t Xf IH^t^* • *t 
32 t if 4 *l & m % 30 ^tt ' * t 

m it % Jb i T<ft**'is#5i#*tt*isjB*ifafli£*ira 
tft ie, ^ t ^ * # * % & m ^ ^ • 

33. *1JSH# 32 fttxf ttt 7* ' ^ ^ 

W#t^if t ' « f ^ ^ - t §<j i ^ ^ 4^ # t i^i i ° 

34. t tfr 4 #1 $1 IS # 1 ^ 4 ^ 5 ^ 6 ^ 7 > 8 > 12 I, 14 ^ t 

«t i ^ « ts- -I- ^ x ^ ^ ^ j. si 4i ir m te, « ^ # « 

' it 4f - % — & * ° 

35. t if 4 *l £ B * 3 4 ^ 4i t ^ IT IK. *&• * ^ * ' £• t 
t£ 9 — ik #r * *T * * 51 # & * #J Ffr & 51 # «. • vx 

36 t * 4- *I H S * 3 5 *jf 4i t ^ If *L %k £ 7T & ' * + 

4i IT ^ ^/f # 4i *b m ^ iK. iz m A * *a M f in i& f 
t « *J iff » 

37. 4a t #1 £ SI * 3 6 ^ 4i t * if 1H. * £ zr - & t 
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^tt?l#tftbt^^lr^i^f#^^ M it *i 
la t • MJt*»«*5I#«t*.*^*t« 
3& *r t * * *U1 ? 36 Jfttxt t^t** ' *t 

in. ia & t ' U'J *i i*- *£ # ?i # m. & f£ «, * a- it • 

39. in t 1* *I 3& B % 1 1 *S + * *i «f * * * ' 3 & ^ 

3, S. SI il if Ul la it # « ' it *f - % — $t * ° 

40. ia t * #1 *£ SI * 3 9 ^ ^ t ^ ; # m. ^ Jr & ' * t 
t* % - # ?fc -T * £ SI # «t jfi, m fk t SI # «. ' « 
J. £ 3 - *f W £ It 1% 4> # «fc ft * *» « & H 0 

41. ia t If *J 31 g| $ 4 0 *S ^ t it ; # *l *fr * >*■ • * t 

4LTfr;ffr*J*f*£#^*i M Hf IH. la Sft iK. ^ 1$ ; # *l la it 

t *» « £'J if 8 

42 ia t I* 4 *'J <£ m % 4 1 3f4Lt*.**Hfr*3r*'** 

# tk £ SI # ft t £ si ^ n & % m « # ^ # M I*- 
ia ^ t ' m 3**»**#3i 

43. ****** *JI£ffl£ 4 1 ^RiLt^irtH.*ft-*^r*'* + 

# it * si # * & m, & n t m nt # ^ * * m % 

HL la ft t ' Mflirfrtt* 51 

44. ia * ft ± *| & B % 15^^**3rm#T£2T>£'3& 
^}^J.St-^ ; #lH.la^:-^#^^ ' i£4f-# — ° 
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45. -ft* t W 4 *'J & B * 44^4Lt^ ; #iR.^^^^'*t 

' « 9 — & £ -T * *t t 51 # «fc ifi. *I I*" * 51 # «t » « 

46 t *fr 4 #'J & B # 4 5 *R t * ^ *l #t * * * • * t 

* 51 # m ^ *» & m i& *r * m & 51 # at # a si * 

fa *x n% ° 

47. t if 4 *I la HI # 46^4ititT)r-*l*Jr*^r*'*t 

4& *I|6B* 46 ^1^ t x f i ^ * • * t 

^ t t ?i # t t * ^ ^ ^ t ^ t t # ^ ^ ft n f 
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(Overlapping Syllable Segments with 

l_^CngLll iV J 




S(N), N=l 


(si) (S2) Jsw) 


0(i Yy, i Y — Z 


/c, e 0 ) /c-, cJ /c n 


CAT) Af= ? 




0(ivy, iV 


{ 0 / 02 ^3 J 4/ ( J 2 0 3 0 4 0 J/ • ♦ • { 0 7 °o 0 y 




A" / Co ?3 9c) A-i V3 .9/:) fe< S7 Sa So S in) 


(Syllable Pair Separated by n Syllables) 




P s (n), n=l 


(SiS 3 ) (s 2 s 4 ) ...(s H s, 0 ) 


P s (n), n=2 


(s, s 4 ) (s 2 s 5 ) ...(s 7 s IQ ) 


P s (n), n=3 


(S, S 5 ) (S 2 S6) ...(S 6 S, 0 ) 


P/n), n=4 


(s, Sg) (s 2 s 7 ) ...(s 5 s w ) 



m i 
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